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We address the problem of properly quantifying information in quantum theory. Brukner and 
Zeilinger proposed the concept of an operationally invariant measure based on measurement statis¬ 
tics. Their measure of information is calculated with probabilities generated in a complete set of 
mutually complementary observations. This approach was later criticized for several reasons. We 
show that some critical points can be overcome by means of natural extension or reformulation of 
the Brukner-Zeilinger approach. In particular, this approach is connected with symmetric informa¬ 
tionally complete measurements. The “total information” of Brukner and Zeilinger can further be 
treated in the context of mutually unbiased measurements as well as general symmetric informa¬ 
tionally complete measurements. The Brukner-Zeilinger measure of information is also examined 
in the case of detection inefficiencies. It is shown to be decreasing under the action of bistochas¬ 
tic maps. The Brukner-Zeilinger total information can be used for estimating the map norm of 
quantum operations. 

Keywords: Brukner-Zeilinger information, complementary measurements, bistochastic maps 


I. INTRODUCTION 

Quantum information science has currently made impressive advances in both theory and practice [1]. Feynman 
emphasized that quantum systems are very hard to be simulated at the classical level [2]. On the other hand, such 
negative claim also inspires a positive reason for trying to build quantum computers [3]. Quantum key distribution 
has provided a long-term technological solution already implemented in a lot of commercial products [4, 5]. Quantum 
algorithms allow to solve efficiently a number of important problems, which are currently intractable [6, 7]. Devel¬ 
opments in quantum information processing stimulated a renewed interest to foundations of quantum mechanics. 
This subject is a thriving, lively and controversial field of research [8, 9]. Currently, conceptual questions are often 
reformulated in information-theoretic terms. Actually, results of a quantum measurement are finally recorded in some 
row of statistical data. Hence, we have come across a problem to quantify an amount of information that could be 
extracted from such data. 

The problem of determining quantum state quite differs from the classical formulation. There are many possible 
scenarios to be imagined. Attacking a system of quantum key distribution, Eve is typically bused with discriminating 
between two or more alternatives known to her a priori. During an individual attack, she captures only a single 
information carrier. An opposite situation deals with a very large ensemble of identical copies. In practice, a number 
of copies is never infinite though large. Our experience leads to the following conclusion. The proportion of times 
that the given outcome occurs settles down to some value as number of trials becomes larger and larger. The ultimate 
value of this proportion is meant as the probability of the given outcome. Dealing with quantum systems, the observer 
can take different experiments, which might even completely exclude each other. For example, the state of a spin-1/2 
system is often considered to be estimated with measurements of the three orthogonal components of spin [10]. In 
more than two dimensions, such complementary measurements are formulated in terms of the so-called mutually 
unbiased bases (MUBs). This concept was actually considered by Schwinger [11]. 

To approach properly an informational measure, Brukner and Zeilinger considered the following situation [10]. 
Suppose that we know probabilities of all outcomes and try to guess a number of occurrences of the prescribed 
outcome among finite experimental trials. Of course, our prediction will allow an amount of uncertainty, which can 
be estimated with respect to some confidence interval. Taking an uncertainty per single trial and summing it for all 
outcomes, one naturally leads to a measure of uncertainty in one experiment [10]. It is shown to be 1 minus the sum 
of squared probabilities. Hence, Brukner and Zeilinger defined a measure of information in one experiment and in a 
set experiments. For d + 1 MUBs, the corresponding total information turned to be operationally invariant in the 
following sense [10]. The sum of the individual measures of information for mutually complementary observations is 
invariant with respect to a choice of the particular set of complementary observations. In other words, this sum is 
invariant under unitary rotations of the measured state. The latter implies that there is no information flow between 
the system of interest and its environment [10]. 

Mutually unbiased bases are an interesting mathematical object as well as an important tool in many physical issues 
[12]. Such bases can be used in quantum key distribution, state reconstruction, quantum error correction, detection 
of quantum entanglement, and other topics. Mutually unbiased bases are connected with symmetric informationally 
complete measurements. A positive operator-valued measure (POVM) is said to be informationally complete, if 
its statistics determine completely the quantum state [13, 14]. To increase an efficiency at determining the state, 
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elements of such a measurement should have rank one. An informationally complete POVM is called symmetric, 
when all pairwise inner products between the POVM elements are equal [15]. In general, the maximal number of 
MUBs in d dimensions is still an open question [12]. When d is a prime power, the answer d + 1 is known [12]. 
Constructions of d + 1 MUBs for such d rely on properties of prime powers and on an underlying finite field [16]. It 
also seems to be hard to get a unified way for building a symmetric informationally complete POVM (SIC-POVM) 
in all dimensions. 

The authors of [17] introduced the concept of mutually unbiased measurements. The core idea is that elements of 
such a measurement are not rank one. This method does not reach the maximal efficiency but is easy to construct. 
It turns out that a complete set of d + 1 mutually unbiased measurements can be built explicitly for arbitrary finite 
d [17]. An utility of such measurements in quantum information science deserves further investigations. It is also 
unknown whether rank-one SIC-POVMs exist in all finite dimensions. The positive answer was obtained with a weaker 
condition that POVM elements are not rank one. The authors of [18] proved the existence of general SIC-POVMs in 
all finite dimensions. It is not insignificant that general SIC-POVMs can be constructed within a unified approach. 
Studies of mutually unbiased measurements and general SIC-POVMs were continued in [19-22]. We will show that 
these measurements are interesting in the context of the Brukner-Zcilinger approach [10, 23-25]. This approach 
to quantifying an amount of information will be shown to be realizable within three additional types of quantum 
measurements. 

The paper is organized as follows. In Section II, preliminary material is reviewed. In particular, we recall the 
definitions of mutually unbiased measurements and general SIC-POVMs. Section III is devoted to a general discussion 
of the Brukner-Zeilinger approach to quantification of information in quantum measurements. Its treatment in terms 
of Tsallis’ entropies of degree 2 is mentioned. In Section IV, we show that an operationally invariant measure of 
information can be approached within the three measurement schemes. They are respectively based on a single SIC- 
POVM, on a set of d +1 mutually unbiased measurements, and on a general SIC-POVM. These measurement schemes 
give an alternative to d + 1 MUBs known only for prime power dimensions. In Section V, the Brukner-Zeilinger 
approach is examined for the case of detection inefficiencies, when the “no-click” events are allowed. In Section VI, we 
show that the Brukner-Zeilinger total information cannot increase under the action of bistochastic maps. Relations 
between the Brukner-Zeilinger approach and non-unitality are examined in Section VII. In Section VIII, we conclude 
the paper with a summary of results. 


II. PRELIMINARIES 

In this section, we review the required material on mutually unbiased measurements and general SIC-POVMs. Let 
C(Hd) be the space of linear operators on d-dimensional Hilbert space kid- By £ + ('Hd), we denote the set of positive 
semidefinite operators on Hd- By C s . a . (T~Ld), we mean the d 2 -dimensional real space of Hermitian operators on kid- 
A state of d -level system is represented by density operator p £ C+CHd) normalized as tr(p) = 1. For operators 
X,Y £ C^Hd), their Hilbert-Schmidt inner product is defined by [26] 

(X,Y)hs :=tr(XtY) . (2.1) 

Quantum measurements are commonly dealt in terms of the POVM formalism [27]. We consider a set of elements 
A j £ C + (Hd) such that the completeness relation holds, namely 

E/i = 1L <*- ( 2 - 2 ) 

Here, the 1 d denotes the identity operator on kid- The set A = {Aj} is called a (POVM). For pre-measurement state 
p, the probability of j -th outcome is written as [27] 

pAAp) = tr(Aj-p) . (2.3) 

It is of key importance that the number of different outcomes can be more than the dimensionality of kid [27]. Of 
course, in practice POVM measurements involve auxiliary systems, so that degrees of freedom are actually added. 

Let £>W = (|6^)} and = ||&^)} be two orthonormal bases in kid- They are mutually unbiased if and only 
if for all j and k, 

| ( &?W )} |=^- (2.4) 

The set B = [B ^\... is formed by mutually unbiased bases (MUBs), when each two bases from this set 

are mutually unbiased. The measurement in one basis cannot give anything about the state, which was prepared in 
another basis. This property is essential in some schemes of quantum key distribution. 
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Let us recall symmetric informationally complete measurements. In d- dimensional Hilbert space, we consider a set 
of d 2 rank-one operators of the form 

(2-5) 

If the normalized vectors \<j>j) all satisfy the condition 

(J ± k) , ( 2 . 6 ) 

the set A f = {Nj} is a symmetric informationally complete POVM (SIC-POVM) [15]. It was conjectured that SIC- 
POVMs exist in all dimensions [28]. The existence of SIC-POVMs has been shown analytically or numerically for all 
dimensions up to 67 [29]. For a discussion of connections between MUBs and SIC-POVMs, see [30] and references 
therein. Weyl Heisenberg (WH) covariant SIC-sets of states in prime dimensions are examined in [31]. WH SIC-sets, 
whenever they exist, consist solely of minimum uncertainty states with respect to Renyi’s 2-entropy for a complete 
set of MUBs [31]. The authors of [32] derived bounds on accessible information and informational power for the case 
of SIC-sets of quantum states. In general, informationally complete sets of positive matrices are discussed in the book 
[33]. The authors of [34] discussed approximate versions of a SIC-POVM, when a small deviation from uniformity of 
the inner products is allowed. 

Basic constructions of MUBs concern the case, when d is a prime power. If d is another composite number, maximal 
sets of MUBs are an open problem [12]. We can try to approach “unbiasedness” with weaker conditions. The authors 
of [17] proposed the concept of mutually unbiased measurements. They consider two POVM measurements V = { Py } 
and Q = {Qfe}- Each of them contains d elements such that 

tr(P j) = tr(Q fe ) = 1 , (2.7) 

tr(PjQfc) = ^ . (2.8) 

Thus, the POVM elements are all of trace one, but now not of rank one. The formula (2.8) replaces (2.4). The 
Hilbert-Schmidt product of two elements from the same POVM depends on the so-called efficiency parameter x [17]. 
It holds that 

tr(PjPfc) = 5jkx+ (1 - 5jk) ^ ! > (2-9) 

and similarly for the elements of Q. The efficiency parameter obeys [17] 

-,<x< 1 . ( 2 . 10 ) 

d 

For x = 1/d we have the trivial case, in which Pj = 1 a/d for all j. The value x = 1, when possible, leads to the 
standard case of mutually unbiased bases. More precise bounds on x will depend on a construction of measurement 
operators. The efficiency parameter shows how close the measurement operators are to rank-one projectors [17]. For 
the given x, we take the set P = ... , plC} of POVMs satisfying (2.9). When each two POVMs also obey 

conditions of the forms (2.7) and (2.8), the set P is a set of mutually unbiased measurements (MUMs). Allowing 
x ^ 1, the authors of [17] built d + 1 MUMs in d-dimensional Hilbert space for arbitrary d. Their construction is 
based on the generators of SU(d). For the given d, the parameter x ranges in the interval, which is determined by 
the smallest or largest eigenvalues of some traceless operators. In this regard, we cannot fix x without specifying d. 
Of course, the efficiency parameter should approach 1 as close as possible. 

Similar ideas can be used in building general SIC-POVMs. For all finite d, a common construction has been given 
[18]. Consider a POVM with d 2 elements Mj, which satisfy the following two conditions. First, for all j = 1 ,d 2 
one has 

tr(MjMj) = a . (2.11) 

Second, the pairwise inner products are all symmetrical, namely 

tr(MjMfe) = b (j^k). (2.12) 

Then the operators Mj form a general SIC-POVM. Combining the conditions (2.11) and (2.12) with the completeness 
relation finally gives [18] 


1 — ad 

= d(d 2 - 1 ) ' 


(2.13) 
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We also get tr(Mj) = 1 /d for all j = 1 ,,d 2 . Therefore, the value a is the only parameter that characterizes the 
type of a general SIC-POVM. This parameter is restricted as [18] 

~cP <a ~d?' (2 ' 14) 

The value a = 1/d 3 corresponds to the case Mj = 1 d/d 2 , which does not give an informationally complete POVM. 
The value a = 1/d 2 is achieved, when the POVM elements are all rank-one [18]. The latter is actually the case of 
usual SIC-POVMs, when POVM elements are represented in terms of the corresponding unit vectors as (2.5). Even 
if SIC-POVMs exist in all dimensions, they are rather hard to construct. Similarly to usual SIC-POVMs, general 
SIC-POVMs have a specific structure that makes them appropriate in determining an informational content of a 
quantum state. 

In Section V, we will use monotonicity of the relative entropy under the action of trace-preserving completely 
positive (TPCP) maps. So, we recall some required material. Let us consider a linear map 

$ : £(H d ) C(U m ) . (2.15) 

To describe physical processes, linear maps have to be completely positive [35, 36]. Let id n be the identity map on 
C(Un), where the ?r-dimensional space T-L n is assigned to a reference system. The complete positivity implies that the 
map (g> id„ is positive for all n. Completely positive maps are often called quantum operations [35]. Each completely 
positive map can be represented in the form [26, 35] 

$ ( x ) = E i K * XK i • (2- ie ) 

Here, the Kraus operators K, map the input space Hd to the output space "H m . The map preserves the trace, when 
the Kraus operators satisfy 


E K ! K i = 1 ^ (2-!7) 

Trace-preserving quantum operations are usually referred to as quantum channels. Applying to a POVM measurement, 
the formula (2.17) merely gives the completeness relation [35]. 


III. ON DEFINITION OF THE BRUKNER—ZEILINGER INFORMATION 

Quantum theory can shortly be characterized as a formal scheme for representing states together with rules for 
computing the probabilities of different outcomes of an experiment [27]. In this regard, the notion of quantum state 
is rather a list of the statistical properties of an ensemble of identically prepared systems. In a series of papers 
[10, 23-25], Brukner and Zeilinger considered the question of informational content of an unknown quantum state. To 
quantify the amount of information, a prospective measure should have some natural properties. These properties are 
also connected with a proper choice of individual experiments or rather a set of experiments. Choosing experiments, 
the observer can actually manage different kinds of information that will manifest themselves, although the total 
amount of information is apparently limited [8]. 

Let us consider an experiment, in which a non-degenerate d-dimensional observable is measured. This test is actually 
connected with the corresponding basis B = {|^)}- As a rule, the observer has only a limited number of systems 
to work with. Keeping the probability distribution pj(B\p) = (bj\p\bj), the observer try to guess how many times 
a specific outcome will occur. In such situation, the number of occurrences of some outcome in future repetitions 
cannot be expected precisely [10]. The authors of [10] suggested to characterize the experimenter’s uncertainty by 
the quantity 


Ubz(B\ P ) :=1-E- =1 P ^ B \ p)2 • (3 ' 1} 

This approach is motivated with considering mean-square-deviation for uncertainty in the number of occurrences. It 
will be convenient to introduce the index of coincidence 

C(B\p) :=J2 d j=1 PMp) 2 • 


(3.2) 
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We then have Ubz(B\p) = 1 — C(B\p). The case of complete lack of information in an experiment corresponds to the 
uniform distribution. Hence, Brukner and Zeilinger proposed to define the measure of information as [10, 23] 

d 

Ibz{B\ P ) :=£ 
i=i 

In principle, the right-hand side of (3.3) could be rescaled by appropriate normalization factor [10]. The latter is chosen 
with respect to the context. Since the uniform distribution is obtained with the completely mixed state p* = 1 d/d, 
we can rewrite the Brukner-Zcilinger information as 

Ibz(B\p) = C(B\p)-C(B\ P .) . (3.4) 

As we will see, this form is useful in studies of the case with detection inefficiencies. Here, the uniform distribution is 
not a good reference point for comparison. It is also convenient for generalizing the approach to POVM measurements. 
Indeed, for a POVM measurement the number of outcomes typically exceeds dimensionality [27]. 

When the observer have many copies of the same quantum state, he will rather tend to measure the state in 
several mutually complementary bases. For example, the state of spin-1/2 could be measured along one of three 
orthogonal axes. The authors of [10, 23] defined the total information content by summarizing the measures (3.3) 
for all complementary tests. Suppose that we have the set B of d, + 1 MUBs in d-dimensional space. For any density 
matrix p G £ + (Hd), one then gives [37, 38] 


P,(B|P)-1) . 


(3.3) 


Y c ( B \p) = 1 + tr (p 2 ) • ( 3 - 5 ) 

Be b 

Thus, the sum of indices of coincidence is determined by the quantity tr(p 2 ) usually called purity [36]. Then the total 
information is represented as 


Y I bz(B\p ) = tr(p 2 ) - tr(p 2 ) = tr(p 2 ) - ^ . (3.6) 

BeB 

It must be stressed here that this quantity is invariant under unitary transformations of p. When we have a set 
M l = {£«,..., B {L) } of L MUBs, there holds [39] 

Y C(B\p) < - +tr(p 2 ) . (3.7) 

B&K l 

For the case L < d + 1, the sum of L indices of coincidence cannot be determined in terms of purity solely. Hence, we 
can only write the inequality 


Y I bz{B\p) < tr \p 2 ) - tr(p 2 ) . (3.8) 

Be M l 

The left-hand side of (3.8) is generally changed under unitary transformations of p. 

The question about invariance or non-invariance under unitary transformations can be illustrated with the three 
spin-1/2 measurements along mutually orthogonal axes [23]. For one and the same spin state, the three coordinate 
axes may be oriented arbitrarily. Here, the total information (3.6) does not depend on such an orientation. Indeed, 
any axes rotation can be reformulated as a unitary transformation of the given state. The eigenbases of the three Pauli 
observables are mutually unbiased, whence the total information (3.6) is invariant under unitary transformations of 
the state. 

The Shannon entropy is one of the basic notions of information theory. If a measurement is described by the 
probabilities pj(B\p), then the Shannon entropy is written as 

Hi[B\p) :=-Y d j= 1 Pi(B |p) ln Pj (B|p) . (3.9) 

Summing the Shannon measures for all the bases, we obtain some total characteristic. It turned out that such total 
characteristic is generally not invariant under unitary transformations. The authors of [23] clearly exemplified this 
fact with the three spin-1/2 measurements along orthogonal axes. As a total measure of informational character, the 
sum of three Shannon entropies has several counter-intuitive properties [23]. First, it can be different for states of the 
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same purity. Second, it changes in time even for a completely isolated system. Third, it depends on particular details 
of an experimental setup. Even in two dimensions, therefore, the mentioned approach to quantifying information in 
quantum measurements seems to be inappropriate. 

Thus, the sum of the Shannon entropies of generated probability distributions is generally not invariant even for 
the case, when d + 1 MUBs exist. In opposite, the total information (3.6) is constant here. Note that the Brukner- 
Zeilinger information can be interpreted in entropic terms. For 0 < a ^ 1, the Tsallis a-entropy of generated 
probability distribution pj{B\p) is defined by 

n a {B\p) := ^ ('Z'PMpr - l) • (3-10) 

' j =i ' 

This entropy is widely used in non-extensive statistical mechanics due to Tsallis [40]. For a = 2, the corresponding 
Tsallis entropy is connected with the index of coincidence as 

H 2 (B\p) = l-C(B\p). (3.11) 

Hence, we represent the Brukner-Zeilinger information as 

Ibz(B\ P ) = H 2 {B\p.) - H 2 (B\p) . (3.12) 

Thus, the Brukner-Zeilinger measure shows a reduction in the uncertainty due to a deviation of the density matrix 
from the completely mixed one. However, the uncertainty is quantified by the Tsallis entropy of degree a = 2. 


IV. THREE SCHEMES WITH SPECIAL TYPES OF QUANTUM MEASUREMENTS 

In this section, we will discuss use of the Brukner-Zeilinger approach with a SIC-POVM, with a complete set of 
MUMs, and with a general SIC-POVM. In each of these cases, we finally obtain an information measure operationally 
invariant in the terminology of [10]. To apply the result (3.5), we have to perform d + 1 projective measurements, 
if the required MUBs all exist. So, it is interesting to examine the Brukner-Zeilinger total information with other 
quantum measurements. For a POVM A = {A y }, we define 

Ibz(A\p) = C(A\p)-C(A\ P J , (4.1) 

where C{A\p) is the sum of all squared probabilities of the form (2.3). The definition (4.1) is a natural generalization 
of the formula (3.4). 

We first mention that a single POVM measurement is sufficient for our purposes. Suppose that A f — {N^} is 
a symmetric informationally complete POVM in d dimensions. As was shown in [41], the corresponding index of 
coincidence is equal to 


= = < 4 - 2 > 

That is, for a SIC-POVM the index of coincidence is expressed in terms of purity of the given density matrix. For 
the completely mixed state, we have 


C(Af\ P J 


t£ (pi) + 1 

d{d+ 1) 


1 

d? • 


For a SIC-POVM, the Brukner-Zeilinger information is represented as 


Ibz{N\p) 


tr(p 2 ) - tr (pi) 
d{d + 1) 


(4.3) 


(4.4) 


This quantity is merely obtained by dividing the total information (3.6) by d(d + 1). In this regard, the quantity 
(4.4) can also be treated as a measure of total information. It is important since SIC-POVMs could exist for those 
values of d , for which d + 1 MUBs do not exist. Say, for MUBs we do not know the answer already for d = 6, whereas 
the existence of SIC-POVMs has been shown for d < 67 [29]. Of course, any SIC-POVM is more complicated for 
implementation than a single projective measurement. However, we need d+1 projective measurements for calculating 
(3.6). Even if d + 1 MUBs exist, the scheme with them may require more costs than the scheme based on a single 
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SIC-POVM. In this respect, the result (4.4) is also significant. At the same time, constructions of SIC-POVMs for 
sufficiently larger d may rather be complicated. We will further see that the Brukner-Zeilinger concept of total 
information can be developed with d + 1 MUMs and with a general SIC-POVM. These types of measurement are 
interesting in the sense that each of them allows a unified theoretical description. 

For arbitrary d, we can built a set of d + 1 MUMs of some efficiency x < 1 [17]. We shall now consider the 
Brukner-Zeilinger approach with such measurements. Let P be a set d + 1 MUMs of the efficiency x in d-dimensional 
space. As was shown in [21, 22], we then have 


E c{ ?\p) = 1 + 

-per 


1 — x + (xd — 1) tr(p 2 ) 

d- 1 


(4.5) 


For pure states, the right-hand side of (4.5) reads 1 + x. The latter was obtained in [17]. With a set P^ = 
{V w ,... ,V iL) } of L MUMs, we can only write the inequality [22] 

£ ctpIp) < U4 + i-* + (^-i)fV) (4 .6) 

veP L 

Due to (4.5), we have arrived at a conclusion. For the complete set P of d + 1 MUMs of the efficiency x and any 
density matrix p G C+(Hd ), one gives 


E T bz(V\p) = ^ [tr(p 2 ) - tr(p 2 )] . (4.7) 

psp 

The right-hand side of (4.7) increases proportionally to the efficiency parameter x. At the prescribed efficiency, 
the sum of the Brukner-Zeilinger information measures is determined by purity solely. For x = 1, the result (4.7) 
is reduced to (3.6). The latter, however, depends on the existence of a complete set of mutually complementary 
observables. Among other critical points, this fact was mentioned in [42]. On the other hand, a set of d + 1 MUMs 
with some x < 1 has been constructed for arbitrary d [17]. Except for x = 1, mutually unbiased measurements 
are not projective. Together, a set of d + 1 MUMs involves d(d + 1) POVM elements. The scheme with a general 
SIC-POVM seems to be more effective, since it involves only d 2 POVM elements. 

Let us proceed to the case of general SIC-POVMs. It is interesting, since general SIC-POVMs can be built within 
a scheme common for all d [18]. In opposite, a unified approach to constructing SIC-POVMs with rank-one elements 
hardly exists. Moreover, the existence of usual SIC-POVMs for all d is plausible but still not proved. For a general 
SIC-POVM M, we have [20] 

C(MM - ^ ~ " ad> . (4.8) 

where the parameter a characterizes this POVM. Due to (4.8), for any density matrix p G C+(/Hd) we then get 

Ibz(M\p) = ^ [ti'(p 2 ) - tr(p 2 )] . (4.9) 

This quantity expresses the total information associated with the general SIC-POVM A4. For a = 1/d 2 , the result 
(4.9) is naturally reduced to (4.4). Thus, the Brukner-Zeilinger approach to quantifying total information of the given 
quantum state can be realized, at least in principle, with mutually unbiased measurements as well as with a general 
SIC-POVM. 

In this section, we have shown that the Brukner-Zeilinger concept of total information can be realized within the 
three measurement schemes. They are respectively based on a single SIC-POVM, on a set of d + 1 MUMs, and on a 
general SIC-POVM. We are sure in existence of the complete set of MUBs only for specific values of the dimensionality. 
We can also recall that even the case d = 6 is still not understood. For this reason, an alternative realization of the 
Brukner-Zeilinger approach is certainly interesting. On the other hand, implementation of such experimental schemes 
may be not easy due to very special structure of measurement operators. So, the developed approach should take into 
account a role of detection inefficiencies. In this regard, the authors of [43] criticized the Brukner-Zeilinger approach. 
In the next section, we examine the question in more details. 



V. FORMULATION FOR MEASUREMENTS WITH DETECTION INEFFICIENCIES 


In practice, measurement devices inevitably suffer from losses. The authors of [43] considered the Brukner-Zeilinger 
approach in the case of non-zero probability of the no-click event. For definiteness, we first describe this case for 
complementary measurements in MUBs. Let the parameter 77 £ [0; 1] characterize a detector efficiency. The no-click 
event is presented by additional outcome 0 . Assume that for any basis B the inefficiency-free distribution {pj(B\p)} 
is altered as 


p^\B\p) = VPj(B\p) , p { 0 (B\p) = 1 - 77 . (5.1) 

In other words, we mean detectors of the same efficiency for all of the used MUBs. This assumption seems to be 
physically natural and has been adopted in [43]. In essence, the above formulation coincides with the first model of 
detection inefficiencies applied in [44]. On the other hand, the authors of [44] focus on measurements in cycle scenarios 
of the Bell type. 

It was noticed that the Brukner-Zeilinger approach may have some doubts in application to more realistic models 
of the experiment. In principle, we could expect that the total information should vanish with negligible 77 . At a 
glance, however, one comes across an opposite situation. The authors of [43] illustrated this conclusion with the 
three spin-1/2 measurements along orthogonal axes. They calculated the sum of three quantities of the form (3.3) for 
different 77 £ [0; 1] and found the following. First, the minimum of the sum is reached at some intermediate value of 
?7 > 0. Second, for 77 —> 0 + the sum becomes even larger than for the inefficiency-free case 77 = 1 . Such results gave a 
ground for criticizing the Brukner-Zeilinger approach [43]. 

In our opinion, these doubts may be overcome with a proper modification of the form (3.3). Here, we compare 
obtained probability distributions with the uniform one. However, such a comparison is meaningful only in the 
inefficiency-free case 77 = 1. In the distribution (5.1), one of probabilities depends on detectors solely. As its value is 
1 — 77 , the uniform distribution does not have actual bearing for the case 77 < 1. Instead, we propose to compare the 
actual probability distribution with the distribution obtained with the completely mixed input. It is quite reached by 
replacing (3.3) with (3.4). More precisely, for the case of detection inefficiencies we use the quantity 

4z(S|p) = C<*\B\p) - CW(B\p.) = H?\B\p.) - H$>\B\p) . (5.2) 

The superscripts emphasize here that the information measures are all calculated with actual “distorted” probabilities. 
Apparently, preparing the completely mixed state is not difficult. For the existing experimental setup, therefore, 
statistics with the completely mixed input can be observed and stored. Stored data will be used in future for 
applications of the definition (5.2). Thus, we shall consider a more realistic case of detection inefficiencies on the base 
of (5.2). It was shown in [45] that for all a > 0 we have 

H^(B\p) = p a H a (B\p) + M 77 ) , (5.3) 

where Ha\B\p) is the a-entropy of “distorted” distribution (5.1). Of course, the binary entropy h a (j]) is written as 

h a {ri) = ——— ( 77 “ + (1 — 7 ?)“ — l) . (5.4) 

1 — a 

For a = 1, results of the form (5.3) were applied in studying entropic Bell inequalities with detector inefficiencies 
[44]. We will also assume that for a POVM A = {Ay} the inefficiency-free probabilities Pj(A\p) are actually altered 
similarly to (5.1). For a = 2, we then have 


H^(A\p) = v 2 H 2 (A\p) + h 2 ( V ) . (5.5) 

The left-hand side of (5.5) is the entropy calculated with actual measurement statistics. 

We can now reformulate the results (3.6), (4.4), (4.7), (4.9) in the case of detection inefficiencies. It is for this 
reason that we modified definition of the Brukner-Zeilinger information according to (5.2). That is, the terms with 
p * also take into account an influence of no-click events. Combining (3.6) with (5.3) for a = 2, we have arrived at 
a conclusion. When d + 1 MUBs exist and form the set B, the total information with actually observed statistics is 
equal to 


4z(s|p) = p 2 ) - tr (p*)] • 

8gb 


(5.6) 
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When the parameter 77 decreases, the total information also decreases proportionally to the square of 77 . With a 
negligible efficiency of detection, no information about the system could be obtained. This very natural picture 
motivates the proposed definition (5.2). 

Using the described model of inefficiencies, we further obtain the following relations. If a POVM A f is symmetric 
informationally complete then 


jb)f A r\ o) - v 2 U(P 2 ) ~ Mp 2 ) 

l BZ ^\P)~ri d{d+l) ■ 


(5.7) 


This result is obtained by combining (4.4) with (5.5). For the complete set P of d + 1 MUMs of the efficiency x, we 
also rewrite (4.7) as 


£ I'Slmp) = if ^hr Mp») - tr( P ;)]. 

vew 

Due to (4.9), for a general SIC-POVM M we have 

IbI{M\p) = t ? 2 -jjjjL _ (L [tr(p 2 ) - tr(p 2 )] . 


(5.8) 


(5.9) 


The right-hand side of any of the formulas (5.6)-(5.9) monotonically increases with the detection efficiency 77 . Thus, 
criticism related to detection inefficiencies is truly overcome by a proper modification of the definition of the Brukner- 
Zeilinger measure. The idea is that the probability distribution used for referencing should take into account the 
parameter 77 . In principle, the results (5.6)-(5.9) could be adopted for verification of concrete experimental setups 
with respect to their efficiency. Of course, the inefficiency model used is very simple in character. Probably, more 
sophisticated models of detection inefficiencies could be developed. Nevertheless, our discussion has shown that the 
Brukner-Zeilinger approach can quite be placed in the context of real experiments with a limited efficiency. 


VI. MONOTONICITY UNDER THE ACTION OF BISTOCHASTIC MAPS 


We have seen that, for some special measurements, the Brukner-Zeilinger total information can exactly be expressed 
in terms of purity of the quantum state of interest. In effect, the four information measures (3.6), (4.4), (4.7), (4.9) 
are all proportional to the quantity 

tr(p 2 ) - tr(p 2 ) = tr(p 2 ) - ^ . ( 6 . 1 ) 

So, we can treat it as a quantum measure of informational content of the given quantum state. The author of 
[46] showed the following fact. The quantity (6.1) is directly connected with usual quantum-mechanical variance 
averaged over every ortlronornral basis in C s , a XT~Ld)- Hence, the Brukner-Zeilinger concept of invariant information 
is supported within a more traditional point of view. We will provide another interesting interpretation for (6.1). 
This interpretation allows to study monotonicity of the Brukner-Zeilinger information under the action of quantum 
stochastic maps. 

The relative entropy is a very important measure of statistical distinguislrability [35]. In the classical regime, the 
relative entropy is also known as the Kullback-Leibler divergence [47]. Its extension to entropic functions of the Tsallis 
type was discussed in [48, 49]. Let supp(p) C 'H d be the subspace spanned by those eigenvectors that correspond to 
strictly positive eigenvalues of p. This subspace is typically called the support of p [35]. For density operators p and 
er, the quantum relative entropy is expressed as [35] 


Di(p||er) 


tr(p lnp — p lner) , if supp(p) C supp(cr) , 
+00 , otherwise . 


( 6 . 2 ) 


Many fundamental results of quantum information theory are closely related to properties of the relative entropy 
[35, 50], 

The divergence (6.2) was generalized in several ways. To connect the Brukner-Zeilinger approach, we will use 
quantum divergences of the Tsallis type. For a € (1; + 00 ), the Tsallis a-divergence is defined as 


D Q (p|k) 


£31 [tr^er 1 a ) - l] , if supp(p) C supp(er) , 
+00 , otherwise . 


(6.3) 
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For a £ (0; 1), we merely use the first entry without conditions. Up to a factor, this relative entropy is a particular 
case of quasi-entropies introduced by Petz [51]. Quasi-entropies are a quantum counterpart of Csiszar’s /-divergences 
[52]. For more details, see the papers [53, 54] and references therein. It is easy to see that the quantity (6.1) can be 
represented as 

tr(p 2 ) - tr(p 2 ) = i D 2 (p||pJ . (6.4) 

This formula gives a connection of the Brukner-Zeilinger total information with the Tsallis relative entropy. 

One of the basic properties of the quantum relative entropy is its monotonicity under the action of trace-preserving 
completely positive (TPCP) maps [35]. As has been shown, the four information measures (3.6), (4.4), (4.7), (4.9) 
are invariant with respect to unitary transformations. Keeping the measurement setup, we now aim to compare the 
Brukner-Zeilinger measure before and after the action of TPCP maps. For this reason, we will focus on the case of 
the same input and output space. Then Kraus operators of the operator-sum representation (2.16) are expressed by 
square matrices. 

In classical regime, the relative a-entropy of Tsallis’ type is monotone for all a > 0 [49]. Due to non-commutativity, 
the quantum case is more complicated in character. The quantum a-divergence (6.3) is monotone under the action 
of TPCP maps for a £ (0; 2], That is, for a £ (0; 2] and arbitrary TPCP map $ we have 

D Q (<F(p)||$(cr)) < D a (/j||cr) . (6.5) 

This claim is based on the general approach of [54] and the following results of matrix analysis. The function £ i —> 
is matrix concave on [0; +oo) for 0 < a < 1 and matrix convex on [0; + 00 ) for 1 < a < 2 (see, respectively, theorems 
4.2.3 and 1.5.8 in [55]). 

Bistochastic maps form an important class of TPCP maps. Recall that we consider the case of the same input and 
output space. Taking arbitrary operators X, Y £ C(Hd), the adjoint map is defined by [26] 

($(X),Y) hs = (X, 4>' i '( y ))hs • (6.6) 

For the completely positive map (2.16), its adjoint is represented as 

* t (X)=E-K. f XK i . (6.7) 

If this adjoint is trace preserving, then Kraus operators of $ also obey 

^ K K 11-/ • (6.8) 

If a quantum map is completely positive and its Kraus operators satisfy both (2.17) and (6.8) the map is called 
bistochastic [36]. Bistochastic maps can be treated as a quantum counterpart of bistochastic matrix, which act in the 
space of probability vectors. The principal fact is that the completely mixed state is a fixed point of any bistochastic 
map, namely 

$(p») = P * • (6-9) 

This property is referred to as unitality of the map [35, 55]. Combining (6.5) with (6.9), we have arrived at a 
conclusion. For a £ (0; 2] and all bistochastic maps $ : £(7id) £(Hd), one gets 

D a($(p)||p*) < D Q (p||p*) . (6.10) 

We will use (6.10) for a = 2. Thus, the quantity (6.4) cannot increase under the action of bistochastic maps. In other 
words, for bistochastic maps we write 

tr($(p) 2 ) - tr(p 2 ) < tr(p 2 ) - tr(p 2 ) . (6.11) 

Due to (6.11), we see that the quantities (3.6), (4.4), (4.7), (4.9) can only decrease under the action of bistochastic 
maps. As was recently shown in [56], bistochastic quantum operation can only increase quantum entropies of very 
general class. 

Since the four measures (3.6), (4.4), (4.7), (4.9) depend on purity of the state, they are all invariant with respect to 
unitary transformation. In the terminology of [10], they are all operationally invariant measures of information. The 
unitary invariance has been treated as one of basic reasons for using just this approach to quantification of information 
in quantum measurements. Further, the above information measures cannot increase under the action of bistochastic 
maps. For a bistiochastic map, its adjoint is a TPCP map as well. Here, the property (6.9) plays a key role. Quantum 
fluctuation theorems form another direction, in which unitality seems to be very important. As was claimed in [57], 
unitality replaces microreversibility as the restriction for the physicality of reverse processes. Significance of unitality 
or non-unitality of quantum stochastic maps deserves further investigations. In the next section, we will discuss some 
relations between this question and the Brukner-Zeilinger total information. 
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VII. NON-UNITAL MAPS AND THE BRUKNER-ZEILINGER APPROACH 

We have seen that the quantity (6.1) can only decrease under the action of bistichastic maps. It is natural to expect 
that (6.1) may be increased for non-unital quantum operations. In this section, we will study connections of the 
Brukner-Zeilinger total information with characterization of such maps. The latter seems to be closely related with 
quantum fluctuation theorems. Recent advances in dealing with small quantum systems have led to growing interest 
in their thermodynamics [58]. A certain progress has been connected with studies of the Jarzynski equality [59] and 
related fluctuation theorems [60, 61]. Recent studies are mainly concentrated on formulations for open quantum 
systems [62-66]. Some of such results have been shown to be valid in the case of bistochastic maps [57, 67]. Jarzynski 
equality for quantum stochastic maps can naturally be formulated in terms of the non-unitality operator [ 68 ]. It turns 
out that norms of this operator can be evaluated within the Brukner-Zeilinger approach. 

Operators of interest are often characterized by means of norms. Some of them are especially important. To each 
X G C{7id), we assign |X| G C + (Hd) as the unique positive square root of X^X. The eigenvalues of |X| counted with 
multiplicities are the singular values of X denoted by Sj(X). For q G [1; oo], the Scliatten < 7 -norm is defined as [26] 

ll x ll« := (E- =a s J (x ) , ) 1/a - (7 ' 1} 

1/2 

This family includes the trace norm ||X||i = tr|X| for q = 1, the Hilbert-Schmidt norm 11X ||2 = (X, X) H ' g for q = 2, 
and the spectral norm 


IIXIloo = max{sj(X) : 1 < j < d} (7.2) 

for q = 00 . These norms are widely used in quantum information theory. They also give a tool for characterizing 
linear maps. For a linear map <k, its norm is defined as 

||$||:=sup{||$(X)|| 00 : IIXIloo = 1} • (7.3) 

We will use the following fact proved, e.g., in item 2.3.8 of [55]. If a map $ : C(7id) —> £( ~Hd ) is positive, then 

INI = IIXIloo . (7.4) 

In terms of the completely mixed state, we write ||4>|| = d ||$(p„)|| 0O . 

For a linear map $ : C.(Hd) —> JCi'Hd), the non-unitality operator is written as [ 68 ] 

G*:=$(pj-p, . (7.5) 

This operator is zero for all bistochastic maps. For TPCP maps, the Hilbert-Schmidt norm of (7.5) is immediately 
expressed in terms of the Brukner-Zeilinger measure of information. Indeed, the squared Hilbert-Schmidt norm of 
G$ is written as 


($(ld) - Id, $(ld) - ld) HS = tr($(ld) 2 ) - 2 tr($(l d )) + d = tr($(l d ) 2 ) - d . (7.6) 

Here, we recall that $(1^) G C+(Hd) is Hermitian and tr($(ld)) = d due to preservation of the trace. Dividing (7.6) 
by d 2 and taking the square root, for a TPCP map we have 

IIM 2 = ^tr($(pj 2 ) - 1 /d = y^tr($(p*) 2 ) -tr(p 2 ) . (7.7) 

Thus, obtaining the Brukner-Zeilinger total information allows also to calculate the Hilbert-Schmidt norm of the 
non-unitality operator. 

The difference tr(<&(p„,) 2 ) — tr(p 2 ) can be evaluated by means of measurements schemes described in Sections III 
and IV. When an unknown quantum channel is given as some black box, we prepare the completely mixed state with 
putting it into the black box. The output $(p*) is further subjected to one of measurement schemes available for the 
given d. This run is repeated as many times as required for collecting measurement statistics. Statistical data should 
be sufficient for evaluation of the left-hand side of one of the relations (3.6), (4.4), (4.7), and (4.9). Thus, we obtain 
the quantity (6.1) for p = $(p„) and apply (7.7). 

Using the result (7.7), for quantum operations we can estimate from above the map norm (7.4). We will use a 
relation between vector norms proved in [41]. It was later applied for deriving fine-grained uncertainty relations for a 
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set of MUBs and a set of MUMs [69]. As follows from the results of appendix A of [41], for any operator X G C.(Hd) 
we have 


IIXIloo < \ (j|X||i + Vd^ly/d ||X||1 - ||X|| 2 ) . (7.8) 

For a TPCP map, we have ||$(p,)||i = tr(<f>(p„)) = 1. Combining this with (7.8) gives 

IISGOIloo < \ (l + v / d^l v'dll^PJIIl-l) • (7-9) 

Due to (7.7) and ||^(p*)|| 1 = tr($(p^) 2 ), multiplying (7.9) by d leads to 

||$|| < 1 + y/d(d-l) ||G«|| 2 . (7.10) 

Thus, for quantum operations the map norm (7.3) is bounded from above in terms of the Hilbert-Schmidt norm of 
the corresponding non-unitality operator. For bistochastic maps, we have ||4>|| = ||ld||oo = 1 and G$ = 0, so that the 
inequality (7.10) is saturated here. 

The above findings can further be illustrated with the following example. Let {|*)} i _ 1 be an orthonormal basis in 
Hd- We consider the quantum operation T : C(7id) —> C^Hd) with Kraus operators 

K, = |i 0 )<t| , (7.11) 

where |*o) is some prescribed state of the basis. This map represents the complete contraction to a pure state. Taking 
|i 0 ) as a ground state, one can describe the process of spontaneous emission in atomic physics. In a certain sense, 
quantum operations of the form (7.11) enjoy extreme non-unitality. The condition (2.17) is clearly satisfied, whereas 

*(!*) = , K i K l t=d|i 0 )<io| • (7.12) 

In this example, we have ^(p*) = |*o)(*o| and tr(\l/(p,J 2 ) —tr(p 2 ) = 1 — 1/d. Hence, the Brukner-Zeilinger information 
reaches its maximal value. We also note that the inequality (7.10) is saturated here. Indeed, substituting the term 
||||2 = i/l — 1/d into the right-hand side of (7.10) results in the value d that is exactly ||'P|| = ||T(ll d )|| 00 . Thus, 
quantum operations of the form (7.11) show a behavior quite opposite to bistochastic maps. 


VIII. CONCLUSION 

We have considered the Brukner-Zeilinger approach to quantifying information in quantum measurements on a 
finite-level system. This problem is essential due to recent advances in quantum information processing. The original 
formulation of Brukner and Zeilinger was based on projective measurements in the complete set of MUBs. This 
formulation is therefore restricted, since even the case of MUBs in dimensionality 6 is still not resolved [12]. We have 
shown that the idea of operationally invariant measure of information can truly be realized within the three schemes 
based on special types of quantum measurements. Namely, these schemes respectively use a single SIC-POVM, a 
complete set of MUMs, and a single general SIC-POVM. Such measurements are easy to construct. In addition, 
costs on the schemes with a single SIC-POVM may be less. The Brukner-Zeilinger measure of information was also 
criticized on the following ground. In real experiments, the “no-click” events inevitably occur. Some doubts in the 
case of detection inefficiencies were discussed in [43]. Such criticism is overcome by means of natural reformulation 
of the approach considered. Namely, the uniform distribution is a good reference only for the inefficiency-free case. 
Otherwise, we should use for comparison some probability distribution that takes into account a real efficiency of 
detectors. The desired probability distribution is naturally obtained by putting the completely mixed state into real 
experiments. The corresponding data can be stored and further used for calculating required quantities. Information 
measures of the Brukner-Zeilinger type are not only unitarily invariant, they cannot also increase under the action 
of bistochastic maps. Using this approach for characterization of non-unital TPCP maps is considered. If a quantum 
channel is given as black box, the measurement schemes described can be used for determining the Hilbert-Schmidt 
norm of the non-unitality operator. Potential applications of information measures of the Brukner-Zeilinger type in 
quantum information science deserve further investigations. The authors of [70] recently proposed the constructor 
theory of information, which is aimed to derive the properties of information entirely from the laws of physics. It 
would be interesting to study measures of information in quantum theory within the constructor theory. 
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